# English speech transcription

Parakeet Ctc 1.1b
Parakeet CTC 1.1B is an automatic speech recognition model jointly developed by NVIDIA NeMo and Suno.ai, based on the FastConformer architecture with approximately 1.1 billion parameters, supporting English speech transcription.
Speech Recognition English
P
nvidia
14.78k
29
Parakeet Rnnt 1.1b
Parakeet RNNT 1.1B is an automatic speech recognition model jointly developed by NVIDIA NeMo and Suno.ai, based on the FastConformer Transducer architecture with approximately 1.1 billion parameters, supporting English speech transcription.
Speech Recognition English
P
nvidia
13.18k
124
Faster Whisper Base.en
MIT
This is a Whisper base.en model converted based on CTranslate2, used for English speech recognition tasks.
Speech Recognition English
F
Systran
367.44k
4
Stt En Fastconformer Ctc Large
This is a large automatic speech recognition (ASR) model based on the FastConformer architecture, specifically designed for transcribing English speech into text.
Speech Recognition English
S
nvidia
1,001
12
Whisper Tiny.en
ONNX weight version of OpenAI Whisper-tiny.en model, designed for Transformers.js, used for English speech transcription.
Speech Recognition Transformers
W
Xenova
33.10k
11
Stt En Conformer Transducer Xlarge
This is an Automatic Speech Recognition (ASR) model developed by NVIDIA, based on the Conformer-Transducer architecture, with approximately 600 million parameters, specifically designed for English speech transcription.
Speech Recognition English
S
nvidia
496
54
Assignment1 Joane
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR)
Speech Recognition Transformers English
A
Classroom-workshop
22
0
Assignment1 Jack
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
A
Classroom-workshop
24
0
Assignment1 Jane
MIT
s2t-small-librispeech-asr is a speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture.
Speech Recognition Transformers English
A
Classroom-workshop
29
0
Wav2vec2 Large 960h Lv60 Self With Wikipedia Lm
An automatic speech recognition (ASR) system based on Facebook's wav2vec2-large-960h-lv60-self model, improved with an enhanced Wikipedia language model
Speech Recognition Transformers
W
gxbag
15
2
Wav2vec2 Large 960h Lv60 Self 4 Gram
Apache-2.0
Based on Facebook's Wav2Vec2-Large-960h-lv60-self model, enhanced with an English 4-gram language model to improve speech recognition accuracy
Speech Recognition English
W
patrickvonplaten
22
4
Wav2vec2 Base 960h 4 Gram
Apache-2.0
Based on Facebook's Wav2Vec2-Base-960h model, with an added English 4-gram language model to improve automatic speech recognition (ASR) accuracy.
Speech Recognition Transformers English
W
patrickvonplaten
19
0
S2t Small Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
S
facebook
10.92k
27
Wavlm Libri Clean 100h Base
An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base
Speech Recognition Transformers
W
patrickvonplaten
6,515
1
Wav2vec2 Tiny Random Robust
Apache-2.0
A lightweight automatic speech recognition (ASR) model, based on a randomly initialized version of the Wav2Vec2 architecture, designed for robustness testing.
Speech Recognition Transformers English
W
patrickvonplaten
406
0
Wav2vec2 Large 960h Lv60 Self
Apache-2.0
The Wav2Vec2 large model developed by Facebook, pre-trained and fine-tuned on 960 hours of Libri-Light and Librispeech audio data, using self-training objectives, achieving SOTA results on the LibriSpeech test set.
Speech Recognition English
W
facebook
56.00k
146
Wav2vec2 Base 960h
Apache-2.0
Wav2Vec2 is a self-supervised learning-based speech recognition model developed by Facebook, trained on the LibriSpeech dataset, supporting English speech-to-text tasks.
Speech Recognition Transformers English
W
tommy19970714
19
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase